Dynamically Adjustable Cache Size Based on Application Behavior to Save 
Power 



FIELD OF THE INVENTION 

This invention relates generally to electronic circuits. More particularly, this 
invention relates to reducing average power in cache memory arrays. 

BACKGROUND OF THE INVENTION 
As more electronic circuits are included on a single die, the power dissipated 
by a single die continues to increase. In order to keep the temperature of a single IC 
(integrated circuit) at a reasonable temperature, many techniques have been used to 
cool the IC. For example, elaborate cooling fins have been attached to the substrate 
of ICs. Also, fans have been positioned near a group of IC's to cool them. In some 
cases, liquids have been used to reduce the heat produced by ICs. These solutions can 
be costly and may require a great deal of space, where space is at a premium. If the 
power on ICs can be reduced while still achieving higher levels of integration, the 
cost and area of devices that use ICs may be reduced. 

As the size of microprocessors continues to grow, the size of the cache 
memory that is often included on a microprocessor chip may grow as well. In some 
applications, cache memory may utilize more than half the physical size of a 
microprocessor. As cache memory grows so does power consumption. 

On-chip cache memory on a microprocessor is usually divided into groups: 

one group stores data and another group stores addresses. Within each of these 

groups, cache is further grouped according to how fast information may be accessed. 

A first group, usually called LI, may consist of a small amount of memory, for 
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example 16k bytes. LI usually has very fast access times. A second group, usually 
called L2, may consist of a larger amount of memory, for example 256k bytes, 
however the access time of L2 is slower than LI. A third group, usually called L3, 
may have even a larger amount of memory than L2, for example 4M bytes. The 
memory contained in L3 has slower access times than LI and L2. 

A performance monitor unit (PMU) on a microprocessor monitors, among 
other things, "misses" that occur in cache memory. A "miss" occurs when the CPU 
asks for information from a section of the cache and the information isn't there. If a 
miss occurs in a LI section of cache, the CPU may look in a L2 section of cache. If a 
miss occurs in the L2 section, the CPU may look in L3. 

Generally, LI cache is accessed more often than L2 and L3 cache, and L2 is 
accessed more often than L3. Because L3 is accessed less frequently than LI or L2, 
there may be times when sections of L3 cache are not accessed. 

The sections of L3 memory that are not being accessed may be monitored 
using a PMU. After identifying memory sections that are not being accessed, the 
power to these sections may be shut off. In this way, power may be directed to 
sections of L3 memory that are currently active and power may be shut-off from 
sections that are not accessed. 

Sections of L3 cache memory may be turned off based on the amount of on- 
chip cache memory a software application needs. For example, transaction- 
processing applications often require larger amounts of cache memory as compared to 
engineering applications. Because the amount of on-chip cache memory of a 
microprocessor is fixed, power may be saved by turning off sections of L3 cache that 
aren't needed for certain applications and turning on more sections of L3 cache when 
other applications require it. 
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When a software application is compiled, a signal may be sent from the 
software application to the PMU to indicate how much L3 cache memory the 
application may need. The PMU then turns on the appropriate amount of cache 
memory needed for that application. 

While a software application is running, the application may also send a signal 
to the PMU to indicate how much L3 cache memory the application needs at that 
time. 

The following description of an apparatus and method for appljang power to 
individual sections of L3 cache fills a need in the art to reduce power in ICs and 
computer systems while maintaining performance requirements. 

SUMMARY OF THE INVENTION 

An embodiment of the invention provides a circuit and a method for 
controlling power in individual memory arrays of a cache memory. Individual arrays 
of memory are isolated from a fixed power supply by inserting one or more switches 
between GND and the negative connection of an individual memory section or 
between VDD and the positive connection of an individual memory section. These 
switches are controlled by a performance monitor unit (PMU). If a memory array is 
not accessed for specific length of time, the PMU will detect it and shut off the power 
to that memory section. If an inactive memory array is accessed, the PMU will detect 
the accesses and provide power to the inactive memory array. A software application 
may also provide information to a PMU concerning how much cache memory is 
needed. This invention fills a need to reduce overall power on a microprocessor chip. 
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Other aspects and advantages of the present invention will become apparent 
from the following detailed description, taken in conjunction with the accompanying 
drawing, illustrating by way of example the principles of the invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic drawing of cache memory elements 
connected to VDD through switches controlled by a PMU. 

Figure 2 is a schematic drawing of cache memory elements 
connected to GND through switches controlled by a PMU. 

DF.TAn.TCD DESCRIPTION OF THE PREFERRED EMBODIMENT 

Figure 1 shows three cache memory arrays, MAll, MA12, and MA13 
connected to a positive power supply, 102, VDD through three switches, Sll, S12, 
and S13 at nodes 110, 112, and 114 respectively. A PMU, PMUll, is connected to 
memory arrays, MAll, MA12, and MA13 at nodes 116, 118, and 120 respectively 
and to a software application, SAll at node 122. Three outputs from PMU, PMUll, 
104, 106, and 108 control switches Sll, S12, and S13 respectively. 

If, for example, PMU, PMUll, detects that memory array MAll has not been 
accessed for a cert^n length of time, the CPU will flush the data, and PMUll will 
send a signal that opens switch Sll. With switch Sll open, power can not be 
supplied to memory array MAll. If after sometime, memory array MAll has a 
number of cache "misses", PMUll will send a signal that closes switch Sll 
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supplying power to MAll. In this manner PMUll may turn power on or off to any 
memory array based on how often the array is utilized. 

A software application, SAll, may also send a signal to PMUll. The 
software application determines how much cache memory it may need and sends that 
information to PMUll. PMUll will either add or remove cache arrays to meet the 
memory needs of the particular software apphcation by switching on-chip cache 
memory in or out. For example if an application does not require the fiiU on-chip 
cache memory, it will send a signal to PMUll to switch off the appropriate number 
of cache memory arrays. The software application may send the proper signal to 
PMUll while the software application compiles or when the software application is 
running. 

Figure 2 shows three cache memory arrays, MAll, MA22, and MA23 
connected to a negative power supply, 202, GND through three switches, S21, S22, 
and S23 at nodes 210, 212, and 214 respectively. A PMU, PMU21, is connected to 
memory arrays, MA21, MA22, and MA23 at nodes 216, 218, and 220 respectively 
and to software application, SA21 at node 222. Three outputs from PMU, PMU21, 
204, 206, and 208 control switches S21, S22, and S23 respectively. 

If, for example, PMU, PMU21, detects that memory array MA21 has not been 
accessed for a certain length of time, PMU21 will send a signal that opens switch 
S21. With switch S21 open, power can not be supplied to memory array MA21. If 
after sometime, memory array MA21 has a number of cache "misses," PMUll will 
send a signal that closes switch S21 supplying power to MA21. In this manner 
MPU21 may turn power on or off to any memory array based on how often the array 
is utilized. 
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A software application, SA21, may also send a signal to PMU21. The 
software application determines how much cache memory it may need and sends that 
information to PMU21. PMU21 will either add or remove cache arrays to meet the 
memory needs of the particular software application by switching on-chip cache 
memory in or out. For example if an application does not require the ftiU on-chip 
cache memory, it will send a signal to PMU21 to switch off the appropriate number 
of cache memory arrays. The software application may send the proper signal to 
PMU21 while the software application compiles or when the software application is 
running. 

Switches may be implemented with MOSFETs (Metal Oxide Semiconductor 
Field Effect Transistor), bipolar transistors, or any other type of semiconductor 
transistor. 

The foregoing description of the present invention has been presented for 
purposes of illustration and description. It is not intended to be exhaustive or to limit 
the invention to the precise form disclosed, and other modifications and variations 
may be possible in light of the above teachings. The embodiment was chosen and 
described in order to best explain the principles of the invention and its practical 
application to thereby enable others skilled in the art to best utilize the invention in 
various embodiments and various modifications as are suited to the particular use 
contemplated. It is intended that the appended claims be construed to include other 
alternative embodiments of the invention except insofer as limited by the prior art. 
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